Fast and Exact Top-k Search for Random Walk with Restart
نویسندگان
چکیده
Graphs are fundamental data structures and have been em-ployed for centuries to model real-world systems and phe-nomena. Random walk with restart (RWR) provides a goodproximity score between two nodes in a graph, and it hasbeen successfully used in many applications such as auto-matic image captioning, recommender systems, and link pre-diction. The goal of this work is to find nodes that have top-k highest proximities for a given node. Previous approachesto this problem find nodes efficiently at the expense of exact-ness. The main motivation of this paper is to answer, in theaffirmative, the question, ‘Is it possible to improve the searchtime without sacrificing the exactness?’. Our solution, K-dash, is based on two ideas: (1) It computes the proximityof a selected node efficiently by sparse matrices, and (2) Itskips unnecessary proximity computations when searchingfor the top-k nodes. Theoretical analyses show that K-dashguarantees result exactness. We perform comprehensive ex-periments to verify the efficiency of K-dash. The resultsshow that K-dash can find top-k nodes significantly fasterthan the previous approaches while it guarantees exactness.
منابع مشابه
Reverse Top-k Search using Random Walk with Restart
With the increasing popularity of social networks, large volumes of graph data are becoming available. Large graphs are also derived by structure extraction from relational, text, or scientific data (e.g., relational tuple networks, citation graphs, ontology networks, protein-protein interaction graphs). Node-to-node proximity is the key building block for many graph-based applications that sea...
متن کاملFast Nearest Neighbors in Large and Composite Networks
We address the problem of k Nearest Neighbor (kNN) search in networks using a random walk based proximity measure. Our approach retrieves the exact top neighbors at query time without relying on off-line indexing or summaries of the entire network. This makes it suitable for very large networks, as well as for composite network overlays mixed at query time. We provide scalability and flexibilit...
متن کاملScalable Nearest Neighbors with Guarantees in Large and Composite Networks
We address the problem of k Nearest Neighbor (kNN) search in networks, according to a random walk proximity measure called Effective Importance. Our approach retrieves the exact top neighbors at query time without relying on off-line indexing or summaries of the entire network. This makes it suitable for very large dynamic networks, as well as for composite network overlays mixed at query time....
متن کاملFast Inbound Top-K Query for Random Walk with Restart
Random walk with restart (RWR) is widely recognized as one of the most important node proximity measures for graphs, as it captures the holistic graph structure and is robust to noise in the graph. In this paper, we study a novel query based on the RWR measure, called the inbound top-k (Ink) query. Given a query node q and a number k, the Ink query aims at retrieving k nodes in the graph that h...
متن کاملRestart and Random Walk in Local Search for Maximum Vertex Weight Cliques with Evaluations in Clustering Aggregation
The Maximum Vertex Weight Clique (MVWC) problem is NP-hard and also important in realworld applications. In this paper we propose to use the restart and the random walk strategies to improve local search for MVWC. If a solution is revisited in some particular situation, the search will restart. In addition, when the local search has no other options except dropping vertices, it will use random ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- PVLDB
دوره 5 شماره
صفحات -
تاریخ انتشار 2012